Corpus: isl-is_web_2020_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 95 98 99 99 99
1000 904 979 993 997 998
10000 6602 9042 9779 9911 9929
100000 16303 25430 28920 29651 29753
1000000 16303 25430 28920 29651 29753


Zipf's diagram for sentence endings


Gnuplot diagram

3022 msec needed at 2021-08-07 18:02